Jastrow correlation factor for atoms, molecules, and solids 
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A form of Jastrow factor is introduced for use in quantum Monte Carlo simulations of finite 
and periodic systems. Test data are presented for atoms, molecules, and solids, including both 
all-electron and pseudopotential atoms. We demonstrate that our Jastrow factor is able to retrieve 
a large fraction of the correlation energy. 
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>—i . Many-electron wave functions may be accurately and compactly approximated by a product of a small number of 
' Slater determinants and a positive Jastrow correlation factor. The Jastrow factor is an explicit function of the electron- 
electron separations, so that expectation values calculated with a Slater-Jastrow wave function do not separate in the 
electron coordinates. Nevertheless, the variational and diffusion quantum Monte Carlo (VMC and DMC) methods 
permit the use of such explicitly correlated wave functions. 

In VMC, expectation values are calculated using an approximate trial wave function, the integrals being performed 
by a Monte Carlo method. In T)MG^ the imaginary-time Schrodinger equation is used to evolve an ensemble 
^ : of electronic configurations towards the ground state. The fermionic symmetry is maintained by the fixed-node 
Q ' approximation;^ in which the nodal surface of the wave function is constrained to equal that of a trial wave function. 
O , The DMC method gives the energy that would be obtained in a VMC calculation with the same Slater determinants, 

■ but using the best possible Jastrow factor. 

O \ Although the DMC energy is in principle independent of the Jastrow factor, a poor trial wave function increases 
c/2 ■ the statistical error bars and the time-step and population-control biases. When non-local pseudopotentials are used 
within DMC, the locality approximation't^ leads to additional errors which are second order in the error in the trial 
, wave function.^ The expectation values of operators that do not commute with the Hamiltonian are often evaluated 

1 using extrapolated estimation^ the accuracy of the extrapolation depending on the quality of the trial wave function. 
In practice the efficiency and accuracy of both VMC and DMC calculations are critically dependent on the availability 
of high-quality Jastrow factors. 

^ \ Our Jastrow factor is designed for use in atoms, molecules, and solids. We have used it in a variety of systems, 

■ and here we report results on the He, Ne^"*", Ne, and Ni atoms; the NiO and Sill4 molecules; and crystalline Si in 
' the diamond structure. These systems include all-electron and pseudopotential descriptions of atoms, with the total 

number of electrons varying from 2 to 216. We pay particular attention to the issue of cutting off terms in the 
• Jastrow factor at finite ranges, which is desirable because of the local nature of the inhomogeneous correlations in 
many systems, as well as for reasons of computational efficiency in large systems. 

We obtained the values of the free parameters in our Jastrow factors by minimizing the variance of the energyi^ 
All of our QMC calculations were performed using the CASINO package.^ We use Hartree atomic units, h = |e| = 
rUe — 47reo = 1, throughout this article. 

The rest of this paper is organized as follows. In Sec.[TTlwe describe the general form of our Jastrow factor, while in 
Sec, mil we show how the electron-electron and electron-nucleus cusp conditions'^ apply to this form. The behavior of 
$_i the local energy at electron-electron and electron-nucleus coalescence points is discussed in Sec. IIVI Sec. IVl describes 
5^ 1 the Jastrow factor in detail. In Sec. I VII we make further comments on the form of our Jastrow factor, while in Sec. IVIII 
we define our notation for specifying the Jastrow factor and give our criterion for judging its quality. In Sees. IVIIIl4Xll 
we report the results of studies of various systems. Finally, we draw our conclusions in Sec. IXIII 

II. GENERAL FORM OF THE JASTROW FACTOR 

The Slater-Jastrow wave function can be written as 

*({ra, {r,}) = exp [J{{v,], {r,})] D{{v,]) , (1) 



where {vi} and {r/} denote the electron and ion coordinates, respectively, exp [J] is the Jastrow factor, and D denotes 
the Slater part, which depends only implicitly on the {r/}. 
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An accurate and efficient Jastrow factor should possess a number of qualities. The functional form of the Jastrow 
factor should be chosen to reflect the physics of the correlations in the system, and it should be parameterized 
efficiently. The trial wave function must be continuous everywhere and its gradient must be continuous wherever the 
potential is finite, so that the kinetic energy is well-defined. The Kato cusp conditions'*^ determine the behavior of the 
many-body wave function when two electrons, or an electron and a nucleus, are coincident. The cusp conditions derive 
from the requirement that the divergence in the local kinetic energy at a coalescence point cancels the divergence in 
the local potential energy. Failure to satisfy the cusp conditions leads to divergences in the local energy '^~^H'i>, 
where H is the Hamiltonian. These divergences are especially harmful in DMC calculations, where they can lead to 
population-control problems and significant biases. It is standard practice to use the Jastrow factor to enforce the 
cusp conditions. The Slater part of the wave function is chosen to satisfy the correct symmetry under exchange of 
electrons, and therefore the Jastrow factor should be symmetric under exchange. Indeed the Slater part of the wave 
function is normally chosen to have the correct symmetries of the state, so we should choose a Jastrow factor that 
does not change this symmetry. Finally, the Jastrow factor should allow rapid evaluation, as this is one of the more 
computationally demanding parts of VMC and DMC calculations. 

Our Jastrow factor is the sum of homogeneous, isotropic electron-electron terms w, isotropic electron-nucleus terms 
X centered on the nuclei, isotropic electron-electron-nucleus terms /, also centered on the nuclei and, in periodic 
systems, plane-wave expansions of electron-electron separation and electron position, p and q. The form is 

N-l N Niaas N Anions W-l N N-1 N N 

J({r,},{r,})= ^ ^ u(ry)+ 2 ^X7(r,/)+ £ 5Z fiini,rji,n,)+Y, P(i-'j) + 1] ^(r^), (2) 

1=1 1=1 i=l 1=1 i=l j=i+l i=l j=i+l i=l 

where N is the number of electrons, iVjons is the number of ions, Tij — Yi — Vj, and ru = — r/. Note that u, x, f, 
p, and q may also depend on the spins of i and j. Although we will present results using spin-dependent parameters, 
for compactness the spin-type has been suppressed in all formulae. The basic form is not novel, as terms of each type 
present in Eq. ^ have appeared in Jastrow factors in the literaturC)^^ but our particular forms of m, x/j // are 
new. 

The plane-wave term, p, will describe similar sorts of correlation to the u term. In periodic systems the u term 
must be cut off at a distance less than or equal to the Wigner-Seitz radius of the simulation cell (see Sec. IV Cp and 
therefore the u function includes electron pairs over less than three quarters of the simulation cell. The p term adds 
variational freedom in the "corners" of the simulation cell, which could be important in small cells. The p term can 
also describe anisotropic correlations, such as might be encountered in a layered compound. However, we expect that 
the u term will be considerably more important than the p term, which cannot describe the electron-electron cusps 
and is therefore best limited to describing longer-ranged correlations. The q term will describe similar electron-nucleus 
correlations to the xi terms. 



III. THE ELECTRON-ELECTRON AND ELECTRON-NUCLEUS CUSP CONDITIONS 



Imposing the cusp conditions on the Jastrow factor is non-trivial because the variables r^j, ru, and Vji are not 
independent. It is important to understand the meaning of the derivatives considered in this section. In Eq. ([3]), for 
example, the derivative d'^ / drij means the derivative with respect to rij with all other coordinates held fixed, while 
in Eq. ([5]) the derivative dJ /drij means the derivative with respect to with and Vj fixed. 



A. The antiparallel-spin electron— electron cusp condition 



Consider the situation where two electrons of opposite spin, i and j, approach one another and the wave function 
is non-zero at the two-particle coalescence point. This condition holds at almost all coalescence points of antiparallel- 
spin electrons. Let us omit the coordinates of all the other electrons and write the wave function in terms of the 
center-of-mass and difference coordinates of electrons i and j , f ^ = (r^ + r j ) /2 and —Vi — rj . The cusp condition- 
is 

5* \ 1 ^ 

/ rij=0 

where ^{fij,rij) is the spherical average of ^'(fy , r^j) about the coalescence point. 
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Neglecting the cuspless p and q terms, the Slater-Jastrow wave function may be written as 

^{Y,j,Y,j) = exp[J(r,,rj-,r,y)]i:>(ry,r,j), 



(4) 



where for clarity we have assumed there is only one nucleus, which is located at the origin. Consider the change in 
the value of ^' for a small displacement from the coalescence point such that the center of mass remains fixed: 



dJ_ 

dri 



dJ_ 

dri 



Sr, 



ri,=0 



dJ 



exp[J,^^=o] (V.,.D),,._o • r,, + ©(r^.), (5) 



where ^r^ and Srj arc the changes in and Vj when the electron separation r,j is increased from zero, and we have 
used Svj = —Svi. If the spherical average about the coalescence point is taken then the terms involving dri and 
vanish to 0{rij), so that 



Hence the antiparallel cusp condition is equivalent to the requirement that 

dJ \ 1 



dri 



2' 



(6) 



(7) 



where r^ , r^, and rj are treated as independent variables. 



B. The parallel-spin electron— electron cusp condition 



Suppose now that the approaching electrons i and j have parallel spins. The cusp conditior^i^ is 

fd^lrn\ 1 



(*lm)..,=0 : 



U / ri,-=0 



where ^'im is the rijYim component of and Yim is the (^,m)th spherical harmonic. 
Let us expand 5* about ry =0. D is an odd function of r^; hence we obtain 



* = exp[J,.^=o](l+(7r-| 



dJ 



dJ_ 

dr^ 



dJ_ 

dri 



(8) 



5r, + 0{rl) \ X ({V,,D)^_^ • r,, + 0{r%)) . (9) 



The change in the electron-nucleus distance when the electron separation r.^ is increased from zero is 5ri — 
rij cos{9i)/2 + 0{rfj), where 6i is the angle between and r^. The rijYim component of is therefore 

- exp[J,,^.^o] U^vD) ■ rJ x ( 1 + ( ^) r,, + ©(rf ) | , (10) 

where [^]i„j denotes the Yi™ component of X. So the parallel-spin cusp condition of Eq. ([8]) is equivalent to the 
requirement that 



dJ 

dra 



4' 



(11) 



where r^-, r^, and rj are treated as independent variables. 



C. The electron— nucleus cusp condition 



Now consider the cusp condition that must be satisfied as electron i approaches a nucleus of atomic number Z. The 
coordinates of all other electrons are omitted. The spherical average of ^'(ri) about the nucleus, ^{ri), must obeyi^ 



d^ 

dri 



ri=0- 



(12) 
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By similar arguments to those given for the antiparallel electron-electron cusp condition, if the Slater determinant is 
continuously diffcrcntiablc at the nucleus then the Jastrow factor must satisfy 



dJ_ 

dn 



-Z. (13) 



Note that if the Slater part of the wave function satisfies the electron-nucleus cusp condition, or if a non-divergent 
pseudopotential is used, then the Jastrow factor is required to be cuspless at the nuclei: it should satisfy Eq. (fT5)) 
with Z ~ Q. 

IV. THE BEHAVIOR OF THE LOCAL ENERGY AT COALESCENCE POINTS 
A. Continuity at antiparallel-spin coalescence points 

The Slater-Jastrow wave function in the vicinity of an antiparallel-spin coalescence point can be written as 

«'(ry) = cxp[M(r,j)]S'(r,j), (14) 

where S is the Slater wave function multiplied by the terms in the Jastrow factor that are analytic at the coalescence 
point and we have assumed there are no / terms in the Jastrow factor. 

Assuming that u satisfies the Kato cusp condition of Eq. ^ , the local energy can be shown to be 




Vis „ (V.,5),_ 







^Lo- ^ V + 0{n,), (15) 



where the Elq and —S'^V^S terms are continuous at the coalescence point. 

Satisfying the cusp condition removes the divergence in the local energy at the coalescence point, irrespective of 
the angle at which the electrons approach. However, the ©(r^ ) term in Eq. (|15p does depend on the direction of 
approach. The local energy therefore has a point discontinuity at antiparallel-spin coalescence points. This behavior 
is illustrated in Fig. [T] 

There is a similar discontinuity in the local energy at nuclei when the electron-nucleus cusp condition is enforced. 
If, on the other hand, the no-cusp condition is enforced at the center of a pseudo-atom, there is no discontinuity in 
the local energy. 

B. Continuity at parallel-spin coalescence points 

Now consider a parallel-spin coalescence point. Again the wave function may be written in the form of Eq. (|14p in 
the vicinity of the coalescence point, but this time S is an odd function of . If the Kato cusp condition of Eq. (fTTj) 
is satisfied by u, the local energy is 

£;l = -^-5|^) - ^^ELQ + 0{n,). (16) 

The —S^^Vf^S term is discontinuous at a parallel-spin coalescence point, giving a point discontinuity in the local 
energy. In spite of this, the local energy is continuous when one electron is moved along a straight line through another 
of the same spin because of the symmetry of the local energy with respect to exchanges of parallel- spin electrons. 
This behavior is illustrated in Fig. [51 
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FIG. 1: Local energy plotted against the position of an electron as it is moved along a straight line through another electron 
of the opposite spin in SiH4. The dotted line indicates the location of the stationary electron. 
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FIG. 2; Local energy plotted against the position of an electron as it is moved along a straight line through another electron 
of the same spin in SiH4. The dotted line indicates the location of the stationary electron. 



C. Further coalescence conditions 

Rassolov and Chipman^^ have demonstrated that, at the coalescence point of two antiparallel-spin electrons i and 

j, 

while for two parallel-spin electrons 



9rf, / 12 I drl / 48 



(18) 



These cusp conditions are difficult to apply to Slater-Jastrow wave functions because Eqs. (|17p and involve 
the Slater determinant as well as the Jastrow factor. If we assume that (d^S/drfA = 0, where S is the 

V ■'/rii=0 
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spherical average of S about an antiparallel-spin coalescence point, then we can derive an approximate condition on 
the antiparallel-spin u term in the Jastrow factor Likewise, if we assume that (d^Si^/dr^j)^ = 0, where 

is the rijYijn component of S about a parallel-spin coalescence point, then we can derive an approximate condition 
on the parallel-spin u term. Imposing these additional conditions was not found to be of any benefit in practice. 

The Rassolov-Chipman conditions can be derived by demanding that ^~^H^ and {'^ imrijYim)~^ H {^imfijYim) 
are cuspless at antiparallel- and parallel-spin coalescence points, respectively.— There seems little point in attempting 
to apply the Rassolov-Chipman conditions unless one has already ensured that the local energy itself is continuous at 
coalescence points. A continuous local energy can be achieved in a two-electron atom by using a trial wave function 
based upon the Fock expansionii^ However, it is unlikely that a practical method for eliminating the local-energy 
discontinuities in larger systems will be found. 



V. THE JASTROW FACTOR 



A. The u, Xi and / terms 



For the u term we use an expression which is a variation on the form we have used for a number of years^ and 



consists of a complete power expansion in up to order r^^^" which satisfies the Kato cusp conditions at Tij = 0, 
goes to zero at the cutoff length, = Lu, and has C — 1 continuous derivatives at L„: 



1=2 / 



(19) 



where 8 is the Heaviside function and F^ = 1/2 if electrons i and j have opposite spins and Fy — 1/4 ii i and j 
have the same spin. In this expression C determines the behavior at the cutoff length. If C = 2, the gradient of u is 
continuous but the second derivative and hence the local energy is discontinuous, and if C = 3 then both the gradient 
of u and the local energy are continuous. 

The form of x is similarly related to our earlier work. 



Xiini) = ini - L^i)^e{L^i - ru) X /3o/ + 



Z, 



ni 



(20) 



It may be assumed that /3„j/ = /3mj where / and J are equivalent ions. The term involving the ionic charge Zj 
enforces the electron-nucleus cusp condition. 

The expression for / is the most general expansion of a function of r^j , ru, and Vjj that is cuspless at the coalescence 
point and goes smoothly to zero when either ru or Vjj reach cutoff lengths: 

^rcN ArcN ATCC 
"fl "fl "fl 

fi{ru,r,j,n,) - ini - Lfjfe{Lfi - ru) x (r,, - LfifQ{Lfi - r,j) x ^ E E li^nnirlirj}r^,. (21) 

Various restrictions are placed on jimni- To ensure the Jastrow factor is symmetric under electron exchanges we 
demand that ^imni — Imini ^ I ,m,l,n. If ions / and J are equivalent then we demand that jimni — Jinmj- The 
condition that the / term has no electron-electron cusps is 



which implies that 



5/ 



= 0, 



(22) 



1=0 m=0 



(23) 



for aU ru- Hence, Vfc G {0, . . . , 2iVS^}, we must have 



(24) 



l,m : l-\-m—k 
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The condition that the / term has no electron-nucleus cusps is 

Of 



0, (25) 



which gives 

J2 J2^CjOmnI - Lfnimnl) X (-i//)^" V™+" (r,, - L f if - 0, (26) 



m=0 n=Q 

for all Tji. We therefore require that, Vfc' £ {0, . . . , Nf} + Nf^}, 

(C70mn/ - Lflllrani) = 0. (27) 

m.n : m+n—k' 

The method by which we impose the various constraints is described in detail in Appendix [A] 

B. The p and q terms 

The p term takes the cuspless form 

Pi'^vj ) ^^aA^cos{GA ■ r,j ) , (28) 

A G+ 

where the {G^} are the reciprocal lattice vectors of the simulation cell belonging to the ^th star of vectors that are 
equivalent under the full symmetry group of the Bravais lattice, and "+" means that, if Ga is included in the sum, 
—Ga is excluded. 

For systems with inversion symmetry the q term takes the cuspless form 

<z(r,) =^6s5Zcos(Gb T,), (29) 

where the {G^} are the reciprocal lattice vectors of the primitive unit cell belonging to the _Bth star of vectors that 
are equivalent under the space-group symmetry of the crystal, and the "+" means that, if G^ is included in the sum, 
—Gb is excluded. In this work q has only been used for systems with inversion symmetry. Plane- wave expansions of 
electron position can also be used for systems without inversion symmetry.— 

C. Cutting off terms in the Jastrow factor 

To avoid unwanted derivative discontinuities in the wave function of a periodic system, the isotropic functions u, x/; 
and // must be cut off at a distance less than or equal to the Wigner-Seitz radius of the simulation cell. Furthermore, 
reasons of efficiency dictate that in particular // should be cut off at short distances in both finite and periodic 
systems. Suppose we wish to evaluate a Slater-Jastrow wave function for a number of systems of increasing size, 
where the number of electrons N is assumed to be proportional to the number of ions, iVions- If the cutoS^ lengths 
L^, and L/ are chosen to be proportional to the size of the system then the numbers of operations required to update 
u and X/ after each electron move are 0{N). The cost of updating the // term is, however, proportional to 0{N'^), 
which is prohibitive in large systems. If we choose the cutoff lengths to be independent of the system size then each 
term can be updated in 0(1) operations. 

The // term describes inhomogeneous correlations that are spherically symmetric about atom /. It does not seem 
likely that // could describe the inhomogeneity in correlations at points distant from atom / in systems with many 
atoms. Similarly we argue that the long-ranged part of the spherically symmetric xi terms will not give useful 
variational freedom. For a fixed number of variable parameters we therefore expect that the best results will be 
obtained by cutting off the xi aiid // terms at distances of roughly the size of atom /, so that the variational freedom 
in these terms is concentrated at short distances where it is most useful. The u term must describe both long- and 
short-ranged correlations and therefore we expect it to be long-ranged. In our implementation we allow the cutoff 
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lengths L„, L^, and Lf to be varied, and these degrees of freedom are investigated in Sees. IVIIlHXIl In many cases 
the optimal value of L„ is approximately proportional to the system size and the optimal values of and Lf are 
approximately independent of the system size, so that overall the cost of updating the Jastrow factor after each 
electron move scales as 0{N). 

As mentioned earlier, the value of C in Eqs. (flQl) . ([20]), and ([2T|) determines the behavior of the Jastrow factor, 
and hence the local energy, at the cutoff lengths. Discontinuities in the local energy at the cutoff lengths may be 
harmful to optimization procedures, but the price paid for having a smoother local energy is a reduction in variational 
freedom. 



VI. FURTHER COMMENTS ON OUR JASTROW FACTOR 

We have used power series in the inter-particle distances rather than scaled variables, such as T'y /(I + brij), which 
have been used by Boys and Handyi^ and others . ^^'^^ These scaled variables go to a constant at large r^j, which is 
useful in finite systems. However, it is not clear whether scaled variables are helpful when the u, %, and / terms are 
cut off at finite lengths, as they must be in periodic systems. 

In our previous Jastrow factors^ we used Chebyshev polynomials rather than the powers themselves. The ideas 
behind this were that (i) the Chebyshev series may be calculated to very high accuracy with double-precision arithmetic 
using recurrence relations and (ii) the optimal coefficients tend to be of a more uniform magnitude, which could be 
helpful within optimization procedures. However, we have found that the precision offered by a simple power series 
with double- precision arithmetic is perfectly acceptable up to an order of at least 20, and we have found no clear 
benefits from the use of Chebyshev polynomials within our current optimization procedures either. We have therefore 
chosen to use simple power series, which may be evaluated more rapidly than the corresponding Chebyshev series. 

Our new Jastrow factor includes terms such as rih{rj, Tij), and rijg{ri, rj), where h and g are polynomials, which are 
absent in the Jastrow factors used by Schmidt and MoskowitzJii^ and some other researchers. Such terms do not in 
general satisfy the cusp conditions on their own, but certain linear combinations of them do and therefore they should 
be allowed to occur in the power series. The Jastrow factors used by Umrigar and coworkersiSi?"'^^ have included such 
terms for many years. We report tests of the importance of the terms neglected in the Schmidt-Moskowitz Jastrow 
factor in Sec. lyTITl 

As an option within our implementation we may try to reduce the extent to which / duplicates the u and x terms in 
the Jastrow factor by imposing the conditions 7oon7 — for all n and 7;oo/ — for all I. Note, however, that the terms 
of / with ^ = TO = do not exactly correspond to u: they are electron-electron terms local to ion /. This variational 
freedom, which is investigated in Sec. IVIIIl may be used to describe correlations that occur on two different length 
scales, for example, in the core and valence electrons of an atom or the intra- and inter-atomic electron correlations of 
weakly interacting atoms. The terms of / with m = n = are less likely to give useful variational freedom. However, 
the use of duplication of u and x by / does not appear to cause any difficulties within our optimization procedure, 
even where — Lf. 

The variable parameters appear linearly in our Jastrow factor, with the exception of the cutoff lengths, L„, L^j, 
and Lf]. The least-squares function in an unreweighted variance minimization is quartic in the linear parameters;^ 
however the dependence on the cutoff lengths is much more complicated, and they couple strongly to the other 
parameters. The use of linear parameters is found to be very advantageous in practice: up to ten times fewer Gauss- 
Newton iterations are required to converge to the minimum of the least-squares function when the cutoff lengths are 
fixed compared with when they arc optimized. 

Our Jastrow factor does not include logarithmic terms such as those motivated by the Fock expansion. ^^-^"^ Although 
these terms have been used in highly accurate Hylleraas-expansion calculations for two-electron atoms^ it should be 
noted that the most accurate calculations of this type performed to date have not included themf24 It cannot therefore 
be necessary to include the logarithmic terms to obtain high accuracy. 

Our Jastrow factor does not include terms involving three or more electrons. Of course the repulsive Coulomb 
interaction and the antisymmetry of the wave function ensures that three or more electrons rarely come close to one 
another, so that such terms are expected to be small, and explicit tests by Huang et al.'^^ support this view. 

In our implementation it is possible to use different u, /, and p functions for antiparallel, parallel spin-up, and parallel 
spin-down pairs of electrons, and different x and q functions for spin- up and spin-down electrons. We investigate the 
effect of using the different possible spin-dependences in the Jastrow factor for a partially polarized system in Sec. [X] 
Note that if different u functions are used for parallel- and antiparallel-spin pairs of electrons then both of the Kato 
cusp conditions are satisfied, but if the same u function is used for all pairs of electrons then only the antiparallel-spin 
cusp condition is satisfied. The use of a Jastrow factor that is not symmetric with respect to exchanges of electrons 
of opposite spin generally produces a trial wave function that is not an eigenfunction of the spin operator 5*^, even 
though the ground-state wave function must be an eigenfunction of 5*^. An investigation into this spin- contamination 
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effect has been carried out by Huang et al^, wfio found that highly optimized wave functions suffer from relatively 
little spin contamination. 



VII. SPECIFICATION OF THE JASTROW FACTOR AND THE MEASURE OF ACCURACY 



In the tests reported here the parameter C, which determines the behavior at the cutoff lengths, takes the values 
C = 2 or C = 3. The terms included in u are specified by Nu, those in x by N^, and those in / by N'j^ and Nj°. In 
each case is the same for all species of atom present, and likewise for N^^ and N^'^. Spin dependences in u, x, f, P, 
and q are specified hy Su, S^, Sf, Sp, and 5"^, where Su — denotes that the same u function is used for parallel- and 
antiparallel-spin pairs, S'„ = 1 denotes that different functions are used for parallel- and antiparallel-spin pairs, and 
Su = 2 denotes that different functions are used for parallel spin-up, parallel spin-down and antiparallel-spin pairs. 

= denotes that the same x function is used for spin- up and spin-down electrons while 5^ = 1 if they are allowed 
to be different, etc. Duplication of the terms in u and the terms in x by / is denoted by "D" = True. The 
terms included in the plane-wave expansions, p and q, are determined by the number of stars of G vectors included, 
Np and A'^,. If "SMJ" = True then only the / terms contained in the Schmidt-Moskowitz Jastrow factor are used 
(i.e., the terms proportional to and n are omitted). In each case we will specify the relevant descriptors and give 
the total number of optimized parameters in the Jastrow factor, Nt- The cutoff lengths are included in the count of 
parameters. 

Unless otherwise stated, the xi functions were chosen to be cuspless at the nuclei (i.e. Zj — in each case) because 
non-divergent pseudopotentials were used or, where the full Coulomb potential was used, the orbitals satisfied the 
electron-nucleus cusp condition. 

To initiate the optimization procedure one must select a set of configurations from a suitable probability distribution. 
We have found that the distribution obtained from the square of the Slater part of the wave function is normally 
an excellent starting point; indeed our results suggest that it may be preferable to the "self-consistent" approach of 
updating the distribution to include the latest estimate of the Jastrow factor. It should be noted that if one sets all 
of the variable parameters in our Jastrow factor to zero, the resulting wave function can be very poor, often giving 
energies which are higher than that obtained using the Slater part only. 

We measure the accuracy of a Jastrow factor by the percentage of the DMC correlation energy retrieved within 
VMC, i.e., 

rj = X 100% , (30) 

where -Ehf is the energy obtained with the Slater determinants only, -Evmc is the VMC energy obtained with the 
Slater-Jastrow wave function, and .Bdmc is the DMC energy. The DMC method gives the energy corresponding to 
a perfect Jastrow factor: see Sec. ID In this work the orbitals in the Slater determinants were kept fixed and we 
only optimized the Jastrow factor. Under these conditions i] is an appropriate measure of the accuracy of Jastrow 
factors. We also report the variance of the local energy, a"^, for each Jastrow factor tested. The energy variance is the 
quantity that determines the size of the statistical error bars for a given computational effort in QMC calculations. 
Furthermore, it is the object that we actually minimize when optimizing the Jastrow factor. 



VIII. EXAMPLE I: HE AND NE ATOMS 



A. Two-electron atoms 



Extremely accurate energies are available for the two-electron He and Ne^+ atoms from variational calculations 
using Hylleraas expansions^ and other methods. It is straightforward to show that the exact ground-state wave 
function of a two-electron atom is a nodeless function of ri, r2, and ri2. It should therefore be possible to obtain very 
accurate results by including /(ri, r2, ^12) terms in the Jastrow factor. As the ground-state wave function is nodeless, 
the DMC energy should equal the exact (non-relativistic and infinite-nuclear- mass) energy, apart from statistical errors 
and biases due to the use of finite time steps and populations. We used orbitals derived from numerical integrations 
of the Hartree-Fock (HF) equations on fine radial grids. 

Tables U and [TTl show variational energies of optimized Jastrow factors for He and Ne^+ . When using u and x 
functions only we obtain 91.17(2)% (He) and 93.64(11)% {'Ne^+) of the correlation energy, but when we add an / 
term we obtain nearly 100% of the correlation energy. Elimination of the terms of the forms rih{rj, rij) and rijg(ri, rj) 
leads to an expression containing the same powers as the Jastrow factor of Schmidt and Moskowitziiii^ The additional 
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TABLE I: Optimized Jastrow factors and VMC energies for He. The HF energy is —2.86167999 a.u., the exact energy is 
— 2.903724 a.u., and the DMC energy is within error bars of the exact value. In each case Su = Sf = Sx = 0, Nu = = Nu,x, 
and Nf = Nf = Nf^"". 
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TABLE IL Optimized Jastrow factors and VMC energies for Ne*+. The HF energy is —93.86111347 a.u., the exact energy is 
—93.906806 a.u., and the DMC energy is within error bars of the exact value. Duplication of u and x by / is permitted. In 
each case Su ^ Sf = ^ 0, iV„ = iV;, = Nu,x, and iV^'"^ = Nf = Nf'"". 



terms are unimportant in He and Ne^+. The results are not strongly dependent on whether C = 2 or 3, or whether 
duplication of the terms in u and x by those in / is prevented or not. 

Our results for Ne®^ are better than our results for He, both with and without the (r^, rj, Vij) terms in the Jastrow 
factor. This is to be expected, because the electron-electron interaction is a smaller perturbation in Ne®+, and hence 
correlation effects are less significant. 

Using a Jastrow factor consisting of a fourth-order Pade function of scaled variables, Umrigar et alX obtained a 
VMC energy of —2.903726(4) a.u. for He, so they were able to retrieve 100% of the correlation energy in this case. 
Making use of scaled variables, instead of cutting off the Jastrow factor at a finite range, would therefore appear to 
be beneficial in the special case of two-electron atoms. 

In general we find that the Jastrow factors which recover a large fraction of the correlation energy have a corre- 
spondingly low variance. However, the variance obtained for Ne^^ using "SMJ" = True is surprisingly high, even 
though the variational energy is about the same as the corresponding result in which the full variational freedom of 
/ is used. 

B. All-electron Ne and pseudo-Ne 

The results of optimizing different Jastrow factors for the all-electron Ne atom are given in Table Hill The importance 
of the / terms is clear: less than 60% of the correlation energy can be retrieved using only u and x, whereas more 
than 90% can be retrieved if / terms are used as well. 

We find that using C = 2 gives slightly better results than C = 3: it does not cause our optimization procedure any 
difficulties, and the extra variational freedom can be exploited in this case. The discontinuities in the local energy do 
not appear to cause any population-control problems for the DMC algorithm either. 

The optimal values of the cutoff lengths L„, L^, and Lf lie between 2 a.u. and 3 a.u. in most cases. In our best 
wave functions i„ is the longest of the three. Where x is absent, however, / has the greatest cutofi' length. We 
tried optimizing more than one / function in order to allow separate (r^ , rj , ) correlations for the core and valence 
electrons, but this did not lower the variational energy. 

If N^^ = NY = 2 and x is absent then it is important to allow / to duplicate u and x- 62(2)% of the correlation 
energy is retrieved when duplication is disallowed whereas 84(2)% is retrieved when duplication is permitted. However, 
the difference is far less pronounced when Nf^ — Nf = 3: about 85% is retrieved irrespective of whether duplication 



11 



C 


>I 


Sf D SMJ Nt 


-EvMC (a.u.) 


n 


cr| (a.u.) 


2 


8 











26 


-128.757(9) 


56(2)% 


3.17(6) 


2 





2 





F 


F 


23 


-128.781(9) 


62(2)% 


3.2(1) 


3 





2 





T 


F 


26 


-128.850(7) 


80(2)% 


2.14(6) 


2 





2 





T 


F 


26 


-128.863(7) 


84(2)% 


2.2(1) 


2 





3 





F 


F 


39 


-128.868(7) 


85(2)% 


2.03(1) 


2 


4 


3 





T 


T 


41 


-128.876(2) 


87.3(6)% 


1.92(1) 


2 





3 





T 


F 


44 


-128.877(6) 


88(2)% 


1.49(3) 


2 


4 


3 





T 


F 


49 


-128.886(2) 


90.0(6)% 


1.27(2) 


2 


4 


3 


1 


T 


F 


75 


-128.8983(2) 93.2(2)% 


1.12(2) 



TABLE III: Optimized Jastrow factors and VMC energies for all-electron Ne. The HF energy is -128.54709807 a.u., the 
exact energy is -128.9376 a.u. r^i^ and our DMC energy is -128.9238(7) a.u. In all cases Su = 1, = 0, Nu = 8, and 
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TABLE IV: Optimized Jastrow factors and VMC energies for pseudo-Ne. An HF pseudopotential was used to represent the 
Ne*+ ion,^ The HF energy is -34.6105 a.u. and the DMC energy is -34.9220(4) a.u. In all cases C = 2, = 1, = 5"/ = 0, 

Nu=N^ = 8, and Nf = Nf = Nf^"". 



is allowed. Using N^^ = NJ'^ = 2 and allowing duplication of u and x gives a more efficient parameterization of the 
Jastrow factor, for the number of parameters is substantially less than is the case when Nj^ = iV|° = 3. In these 
calculations the optimal cutoff length of / {Lf ~ 3.7a.u.) is greater than that of u {Lu — l.Oa.u.). Isolated atoms 
are a special case in which the x function can be long-ranged. In the absence of x, / is forced to be long-ranged so 
that it can describe the electron-nucleus correlations. Hence u, rather than /, has to describe all the short-ranged 
electron-electron correlations. 

We obtain slightly better results when we include the terms in / that are neglected in the Schmidt-Moskowitz 
Jastrow factor. The VMC energy is fairly insensitive to the spin-dependence of /. 

We have investigated whether it is better to include the electron-nucleus cusp in the Jastrow factor or in the orbitals 
in the Slater wave function. Calculations were carried out using orbitals expanded in a Gaussian basis set, generated 
by the crystal code.^^ The x term in the Jastrow factor satisfied the electron-nucleus cusp condition. The results 
obtained were significantly poorer than those shown in Table IIIII In order to get reasonable variational energies, a 
very large number of x parameters was required, with > 15. Even with N-^ = 15, only about 25% of the correlation 
energy was retrieved. It is clearly preferable to use orbitals that satisfy the electron-nucleus cusp condition. 

There is a significant fixed-node error in the DMC energy: our DMC energy is 0.0138(7) a.u. higher than the 
exact non-relativistic ground-state energy i^^i^^ We have verified that population-control biases are negligibly small 
and we have performed an extrapolation to zero time step, so the only remaining bias in our DMC energy is the 
fixed-node error. The best all-electron VMC energy reported in the literature is that of Huang et al.,^^ who optimized 
parameters in their orbitals at the same time as their Jastrow factor, giving them extra variational freedom, including 
the opportunity to reduce the fixed-node error. Using a Jastrow factor containing the same types of correlation as 
ours (electron-electron, electron-nucleus, and electron-electron-nucleus), and optimizing the orbitals as well as the 
Jastrow factor, Huang et al. obtain a VMC energy of —128.9008(1) a.u., which is only slightly lower than our best 
energy of -128.8983(2) a.u. 

The results of optimizing Jastrow factors for pseudo-Ne are shown in Table IIVI / is much less important in the 
pseudo-atom than in all-clcctron Ne. 86.3(5)% of the correlation energy is retrieved using u and x only, while 95.7(4)% 
is retrieved when / is used too. A greater fraction of the correlation energy can be retrieved in the pseudo-atom than 
in the all-electron atom. 
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TABLE V: Optimized Jastrow factors and VMC energies for SiH4. The HF energy is —6.118 a.u. and the DMC energy is 
-6.3064(2) a.u. In each case Su = 1, = S/ = 0, Nu = N^ = Nu.x, and Nf = iVf = Nf-"". Duplication of u and x by / 
is prohibited. 
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TABLE VI: Optimized Jastrow factors and VMC energies for pseudo-Ni. The HF energy is —38.6670 a.u. and the DMC energy 
is -39.2310(5) a.u. In each case C = 3, = = 8 and Nf = A^f = Nf'"". 



IX. EXAMPLE II: SIH4 MOLECULE 

We used a bond length of 2.8289 a.u. for the SiH4 (silane) molecule, in which the Si"'+ ion was represented by 
a relativistic HF pseudopotential^° and the full Coulomb potential was used for the hydrogen nuclei. The orbitals 
forming the Slater determinants were obtained from HF calculations using a large Gaussian basis set and the GAUSSIAN 
codej^ 

Results for some of the Jastrow factors tested for SiH4 are given in Table |Vl We find that a large fraction of the 
correlation energy can be obtained using rather simple Jastrow factors. Using u and x functions only, and with a 
total of only 7 parameters, we are able to obtain almost 90% of the correlation energy. Our best Jastrow factors 
obtain about 93(1)% of the correlation energy. We find the optimal cutoff lengths (L„ ~ 10 a.u. and L^si — L^u ~ 5 
a.u.) to be fairly independent of and N^. There is no detectable benefit from going beyond = = 4, or from 
introducing / functions. Both the results obtained and the behavior of the optimization procedure are very similar 
for C = 2 and C = 3, so that in this case there is no benefit from having a continuous local energy. 

X. EXAMPLE III: NI ATOM AND NIO DIMER 

We investigated the Ni atom and the NiO dimer with a bond length of 3.075 a.u., using HF pseudopotentials^^ to 
represent the Ni^°+ and 0®+ ions. The orbitals were obtained from HF calculations using a large Gaussian basis set 
and the crystal codcj^ We find that the / functions are significant for both the Ni atom (Table IVI[) and the NiO 
dimer (Table |VlT| in spite of the use of pseudopotentials. 

Note that Ni and NiO are partially spin-polarized, so that it may be advantageous to have different x functions 
for spin-up and spin-down electrons, and different u and / functions for parallel spin-up and parallel spin-down pairs 
of electrons, unlike the other systems studied in this work. Our results show that the spin-dependences of the u, x, 
and / functions do indeed have a significant effect on the quality of the wave functions for Ni and NiO, although 
including {ri,rj,rij) terms in the Jastrow factor has a greater effect. An additional 1-2% of the correlation energy 
can be retrieved when x and u are allowed to differ for spin-up and spin-down electrons. Using different / functions 
for antiparallel, parallel spin-up, and parallel spin-down pairs also lowers the variational energy slightly, although it 
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jyeN.ee g^^^ j^^ EyMC (a.U.) 7] G% (a.U.) 

1 35 -54.9031(5) 69.99(8)% 2.144(4) 

1 1 51 -54.9069(5) 70.45(8)% 2.132(4) 

3 1 77 -54.9984(4) 81.31(7)% 1.521(3) 

3 1 1 157 -55.0104(4) 82.74(8)% 1.451(2) 

3 2 1 165 -55.0105(4) 82.75(8)% 1.439(2) 



TABLE VII: Optimized Jastrow factors and VMC energies for the NiO dimer. The HP energy is -54.31362 a.u. and the DMC 
energy is -55.1558(6) a.u. In each case C = 2,, Su ^ I, = S f = S^j, = = 8, and Nf = Nf = Nf-"". Duplication 
of u and x by / is forbidden. 
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TABLE VIII: Optimized Jastrow factors and VMC energies for pseudo-Si (16-atom simulation cell). The HF energy is 
—7.63946 a.u. per primitive cell and the DMC energy is —7.90600(6) a.u. per primitive cell. In each 

= Sf ^ Sg = 0, Nu = = Nu,x, Nf = Nf = N'f'''°, and Np = Nq = Np,q. Duphcation of u and x by / is allowed. 



greatly increases the number of parameters which have to be optimized. 

These calculations are the only ones for which we have retrieved less than 90% of the correlation energy. 

XI. EXAMPLE IV: SI SOLID 
A. 16-atom simulation cell 

We modeled crystalline Si in the diamond structure using a 16-atom, face-centered cubic simulation cell subject to 
periodic boundary conditions. We used a cubic lattice constant of 5. 12966 a.u., and the Si'*"'" ions were represented by 
pseudopotentials.^° The orbitals were obtained from HF calculations using a large Gaussian basis set and the crystal 
codei^ The results of optimizing the our Jastrow factor are shown in Table IVllIl 

adjusts itself to sizes of the order of the inter-atomic spacing (L^ c:± 6.4 a.u., whereas the nearest-neighbor distance 
is 4.4424 a.u.), while L„ tends to the largest possible value, which is the Wigner-Seitz radius of the simulation cell 
(7.2544 a.u.). 

It is much easier to optimize the cutoff lengths when C — 3 than when C — 2. It seems that the discontinuities in 
the local energy that are present when C = 2 cause serious problems for our optimization procedures. For example, 
a flexible Jastrow factor with C — 2 gives a lower variance, but higher energy, than a simple Jastrow factor with 
C = 3. Such problems were not apparent in our calculations for atoms and small molecules. The discontinuities do 
not appear to lead to any problems within DMC calculations, however. 

The use of the p and q terms does not bring about a statistically significant lowering of the VMC energy. The 
optimal value of is a little less than the Wigner-Seitz radius when p terms are included, so the p term must 
describe the long-ranged correlations, as expected. The sinusoidal p and q functions are considerably more expensive 
to evaluate than the polynomial u and x functions, so it is anticipated that p and q will rarely be used in practice, 
except in strongly anisotropic systems such as graphite, where Prendergast et alM' have demonstrated that plane- wave 
expansions in the Jastrow factor have an important role to play. 
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B. 54-atom simulation cell 

Similar calculations to those reported in Sec. IXIAl were carried out using a 54-atom simulation cell. The HF energy 
is —7.6792 a.u. per primitive cell and the DMC energy is —7.9555(2) a.u. per primitive cell. The VMC energy obtained 
using our Jastrow factor with 7V„ = Nx = 4, S'„ = 1, and = (giving 14 free parameters) is —7.9331(6) a.u. per 
primitive cell, so 91.9(2)% of the correlation energy is retrieved. A very similar fraction of the correlation energy is 
retrieved in the 16-atom cell (see Table IVlII| . The VMC energy variance is 2.92(5) a.u., so the variance per electron is 
about the same as for the 16-atom simulation cell. The optimal value of L„ is again equal to the Wigner-Seitz radius 
of the simulation cell (10.882 a.u.). The optimal value of remains of order the atomic size, at ~ 4.7 a.u. These 
results indicate that cutting off the x function at sizes of order the inter-atomic spacing is a valuable improvement to 
the Jastrow factor. 



XII. CONCLUSIONS 

We have developed and tested a form of Jastrow factor consisting of electron-electron (u), electron-nucleus (x/), and 
electron-electron-nucleus (//) terms, and additional clcctron-position-dcpendent [q) and electron-electron-separation- 
dependent (p) terms. The u, xii fi terms are expanded in polynomials and are forced to go to zero at some 
cutoff radii. The p and q terms are expanded in plane waves. We have tested our Jastrow factor on atoms, molecules, 
and solids, including both all-electron and pseudopotential atoms. In most cases our VMC calculations have retrieved 
over 90% of the fixed-node correlation energy. 

The variable parameters appear linearly in our Jastrow factor, except for the cutoff radii of u, x/i s-^d //■ The 
linearity in the variable parameters aids the computational efficiency of the optimization algorithm. We have found 
that it is often beneficial to make the local energy continuous at the cutoffs when optimizing the cutoff radii, but that 
a lower variational energy can be achieved in some cases when discontinuities are allowed at the cutoffs. 

We have investigated the importance of terms in the Jastrow factor that were neglected by Schmidt and 
Moskowitz^iiS but we found them to be unimportant in the systems studied here. 

The electron-electron-nucleus // terms were found to be important in all-electron simulations of the He, Ne®"'', and 
Ne atoms. The // terms are generally less important for pseudo-atoms than all-electron atoms. For example, they 
account for nearly 40% of the correlation energy in all-electron Ne, but only about 10% in pseudo-Ne. We found the 
// terms to be significant for pseudo-Ni and the NiO dimer, but they had little effect in SiH4 or crystalline Si. The 
plane-wave terms p and q were found to be unimportant in crystalline Si. 

We have found that it is preferable to use orbitals which satisfy the electron-nucleus cusp condition and to require 
the Jastrow factor to be cuspless at nuclei rather than to enforce the electron-nucleus cusp condition through the 
Jastrow factor. 

Overall we found that optimizing the cutoff radii is very important. In crystalline Si the cutoff for u adjusted itself 
to the largest possible value, which is the Wigner-Seitz radius of the simulation cell. This indicates that there are 
significant correlations in Si extending over many atoms. We have argued that the long-ranged parts of the xi a-nd // 
terms do not give useful variational freedom, and in support of this we found that the corresponding cutoffs adjusted 
themselves to sizes of the order of the inter-atomic spacing. 

Although we have optimized our Jastrow factors by minimizing the variance of the energy, we have measured the 
accuracy of the Jastrow factors in terms of the variational energy itself. If we were to minimize the variational 
energy directly then we might get even better results. Recently there has been much interest in developing energy- 
minimization methods for optimizing trial wave functions, "^^"'^'^iSi and we intend to pursue this avenue further. 
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APPENDIX A: IMPOSING THE CONSTRAINTS ON 7 

Consider a particular set of ions / and a particular spin-pair type. To impose the symmetry of / under interchange 
of electrons we work with 'jimni, where / > to, and then complete the 7 array by setting 'yimni = Jmini for I < to. 
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Let X be a vector whose components are each of the 7 coefBcients with I > m. The remaining constraints (no-cusp 
conditions and, optionally, no-duplication-of-u-and-x conditions) may then be written in matrix form as Ax. — 0. The 
total number of constraints determines the number of rows of A: 

• There are 2Nj^ + 1 constraints (one for each value of fc = ^ + m) associated with the imposition of the electron- 
electron no-cusp condition. Using the symmetry of 7, Eq. (|24p can be rewritten so that only elements of x are 
involved: Vfc £ {0, . . . , 27V|^}, 

For each fc, this equation defines a row of A. 

• There are Nj^ + Nfj + 1 constraints ( one for each value of fc' = ^ + n) associated with the imposition of 
the electron-nucleus no-cusp condition. Eq. (|27p can be rewritten so that only elements of x are involved: 

Vfc'e{0,...,iV|N + iVf,}, 

Cjook'I - LfiJiok'I + Y i'^llOnl - Lfijiini) = 0. (A2) 



For each k', this equation defines a row of A. 

• If desired, there are NJ'f constraints imposed to prevent duplication of u. (700^/ = Wn.) 

• If desired, there are iV^j constraints imposed to prevent duplication of x- iliooi — 

Imposing the constraints reduces the number of independent variable parameters by rank(^). The matrix A is 
transformed into its row-reduced echelon form by Gaussian elimination, which allows us to identify a suitable set of 
independent variable parameters. 
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